Extending the definition of beta-consistent biclustering for feature selection
نویسنده
چکیده
Consistent biclusterings of sets of data are useful for solving feature selection and classification problems. The problem of finding a consistent biclustering can be formulated as a combinatorial optimization problem, and it can be solved by the employment of a recently proposed VNS-based heuristic. In this context, the concept of β-consistent biclustering has been introduced for dealing with noisy data and experimental errors. However, the given definition for β-consistent biclustering is coherent only when sets containing non-negative data are considered. This paper extends the definition of β-consistent biclustering to negative data and shows, through computational experiments, that the employment of the new definition allows to perform better classifications on a well-known test problem.
منابع مشابه
A New Heuristic for Feature Selection by Consistent Biclustering
Given a set of data, biclustering aims at finding simultaneous partitions in biclusters of its samples and of the features which are used for representing the samples. Consistent biclusterings allow to obtain correct classifications of the samples from the known classification of the features, and vice versa, and they are very useful for performing supervised classifications. The problem of fin...
متن کاملBiclustering and Feature Selection Techniques in Bioinformatics
The paper describes several data mining techniques, developed to solve problems which are faced by biologists in Bioinformatics.Several biclustering algorithms which perform clustering on the two dimensions simultaneously are described. Other techniques described in this paper include feature selection methods which help in reducing noise and improving the performance of the classification model.
متن کاملApplying Feature-Selection Algorithm to Predict Landslide in the Southwest of Iran
Extended abstract 1- INTRODUCTION Nowadays people have an increased sensitivity towards landslides especially in mountainous areas using change in the land use and the expansion of communication networks (Gvrsysky et al., 2006). In the twentieth century, Asia has allocated the highest incident of landslides (220 landslides). Latin America has had the highest number of casualties (more than 2,...
متن کاملModels and Issues in Consistent Biclustering
Biclustering is a methodology allowing simultaneous partitioning of a set of samples and their features into classes. Samples and features classified together are supposed to have a high relevance to each other which can be observed by intensity of their expressions. The notion of consistency for biclustering is defined using interrelation between centroids of sample and feature classes. Consis...
متن کاملFeature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets
Objective(s): This study addresses feature selection for breast cancer diagnosis. The present process uses a wrapper approach using GA-based on feature selection and PS-classifier. The results of experiment show that the proposed model is comparable to the other models on Wisconsin breast cancer datasets. Materials and Methods: To evaluate effectiveness of proposed feature selection method, we ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011